Standards in Genomic Sciences (2014) 9:1046-1061 



DOI:10.4056/sigs.5459590 



Non-contiguous finished genome sequence and description of 
Bacillus algeriensis sp. nov. 

Esma Bendjama^'^ Lotfi Loucif Seydina M. Diene\ Caroline Michelle', Djamila Gacemi- 
Kirane^ Jean-Marc Rolain'* 

' Unite de recherche sur les maladies infectieuses et tropicales emergentes (URMITE), UMR 
CNRS, IHU Mediterranee Infection, Faculte de Medecine et de Pharmacie, Aix-Marseille- 
Universite, Marseille, France. 

^ Departement de Biochimie, Faculte des Sciences, Universite Badji Mokhtar, Annaba, Alge- 
rie 

^ Departement des Sciences Biologiques, Faculte des Sciences, Universite El Hadj Lakhdar, 
Batna, Algerie. 

*Correspondence: Jean-Marc Rolain (jean-marc.rolain@univ-amu.fr) 

Keywords: Bacillus algeriensis, hypersaline environments, sediments, genome, taxono- 
genomics 



Strain EBOI^ sp. nov. is the type strain of Bacillus algeriensis, a new species within the genus 
Bacillus. This strain, whose genome is described here, was isolated from sediment sample of 
the hypersaline lake Ezzemoul sabkha in northeastern Algeria. B. algeriensis is a facultative 
anaerobic Gram-positive bacillus. Here we describe the features of this organism, together 
with the complete genome sequence and annotation. The 5,269,577 bp long genome con- 
tains 5,098 protein-coding and 95 RNA genes, including 12 rRNA genes. 



Introduction 



Bacillus algeriensis sp. nov. strain EBOIt [= CSUR 
P857 = DSM 27334) is the type strain of B. 
algeriensis sp. nov. It is a new Gram-positive, 
facultatively anaerobic, motile, indole-negative, 
rod shaped bacterium with rounded ends. It was 
isolated from a sediment sample from the 
hypersaline lake Ezzemoul sabkha in the Oum-El- 
Bouaghi region in northeastern Algeria, which is 
an important wintering and resting site for several 
species of waterbirds, including the Greater Fla- 
mingo. This site is one of the Ramsar convention 
wetlands [http://www.ramsar.org). The genus 
Bacillus was created by Cohn about 142 years ago 
[l],and mainly comprises Gram-positive, rod- 
shaped, aerobic or facultatively anaerobic, spore- 
forming bacteria. The genus includes 279 species 
and 7 subspecies with validly published names 
[2]. Members of Bacillus genus are ubiquitous in 
nature, ranging from freshwater to marine sedi- 
ments and from hot springs and desert sands to 
Arctic soils; many strains have been isolated from 
the gastrointestinal tracts of various insects and 
animals, from vegetation and from food [3]. Bacil- 
lus strains are biotechnologically priceless be- 



cause of their high capacity to produce a wide 
range of antimicrobial compounds, enzymes and 
other metabolites that can be used in industry 
[4,5]. Some species of Bacillus are pathogenic, 
such as B. anthracis [responsible for causing an- 
thrax) [6] and B. cereus [a major cause of food poi- 
soning) [7]. Others are opportunists in 
immunocompromised patients, and may also be 
involved in various human infections, including 
pneumonia, endocarditis, ocular, cutaneous, bone 
or central nervous system infections and bacte- 
remia [8] .The current bacterial taxonomy is based 
on a combination of various phenotypic and ge- 
netic criteria [9,10]. However, the three essential 
genetic criteria that are used, comprising 16S 
rRNA gene based phylogeny [11], G+C content, 
and DNA-DNA hybridization [10,12] exhibit sev- 
eral drawbacks. As a result of the recent decrease 
in the cost of genomic sequencing, it has been 
proposed that whole genome sequencing infor- 
mation and MALDI-TOF spectrum [13] be com- 
bined with the main phenotypic characteristics as 
a polyphasic approach strategy [taxono-genomics) 
to describe new bacterial taxa [14-26]. 
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Here we present a summary classification and a 
set of features for B. algehensis sp. nov. strain 
EBOl''' together with the description of the com- 
plete genome sequence and annotation. These 
characteristics support the circumscription of the 
species B. algeriensis. 

Classification and features 

In July 2012, a sediment sample was aseptically 
collected in sterile bottles, 15 cm below the 
evaporite crust of the hypersaline lake Ezzemoul 
sabkha of Oum-El-Bouaghi region in northeastern 
Algeria. Samples were transferred in a cooler 
(4°C] to our lab in Algeria. Samples were pro- 
cessed the same day. Sediments were diluted 1:10 
v/v with sterile saline water (0.9% NaCl) and vig- 
orously shaken, tenfold serial dilutions (lO-i-lO-s) 



of the sediment suspension were plated in Nutri- 
ent Agar (NA) medium (meat extract 1 g/1, pep- 
tone 5 g/1, yeast extract 2 g/1, sodium chloride 5 
g/1, agar 15 g/1] and the plates were incubated at 
30°C for 24-72 h. In order to obtain a pure culture, 
colonies were transferred to fresh NA medium. 
Bacillus algeriensis sp. nov. strain EBOl''' (Table 1) 
was isolated in July 2012 by cultivation under 
aerobic conditions at 30°C. This strain exhibited a 
97.0% 16S rRNA nucleotide sequence similarity 
with Bacillus subterraneus type strain DSM13966T 
(Figure 1), the phylogenetically closest validly 
pubHshed Bacillus species. These values were 
lower than the 98.7% 16S rRNA gene sequence 
threshold recommended by Stackebrandt and 
Ebers to delineate a new species without carrying 
DNA DNA hybridizidation [11]. 



Table 1 . CI; 


assification and general features 


of Bacillus algeriensis strain EBOI^ 




MIGS ID 


Property 


Term 


Evidence code^ 






Domain Bacteria 


TAS [27] 






Phylum Firmicutes 


TAS [28-30] 






Class Bacilli 


TAS [31,32[ 




Current classification 


Order Bacillales 


TAS [33,34] 






Family Bacillaceae 


TAS [33,35] 






Genus Bacillus 


TAS [1,33,361 






Species Bacillus algeriensis 


IDA 






Type strain EB01 ^ 


IDA 




Gram stain 


Positive 


IDA 




Cell shape 


Rod-shaped 


IDA 




Motility 


Motile 


IDA 




Sporulation 


Sporulating 


IDA 




Temperature range 


Between 37°C and 55°C 


IDA 




Optimum temperature 37°C 


IDA 






Growth in LB medium + 0-2.5% 




MIGS-6.3 


Salinity 


NaCI 


IDA 


MIGS-22 


Oxygen requirement 


Facultative anaerobic 


IDA 




Carbon source 


Unknown 


NAS 




Energy source 


Unknown 


NAS 


MIGS-6 


Habitat 


Hypersaline sediment sample 


IDA 


MlGS-15 


Biotic relationship 


Free living 


IDA 




Pathogenicity 


Unknown 


NAS 




Biosafety level 


2 


NAS 


MlGS-14 


Isolation 


Sediment of Ezzemoul Sabkha Lake 


IDA 


MlGS-4 


Geographic location 


Algeria 


IDA 




Sample collection 


July 2012 




MlGS-5 


time 




IDA 


MIGS-4.1 


Latitude 


35.88167 


IDA 


MIGS-4.1 


Longitude 


6.503272 


IDA 


MlGS-4.3 


Depth 


Unknown 


NAS 


MIGS-4.4 


Altitude 


800 m 


IDA 



^Evidence codes - IDA: Inferred from Direct Assay, TAS: Traceable Author Statement (i.e., a direct report exists in the 
literature), NAS: Non traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based 

on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene 
Ontology project [37[. If the evidence is IDA, then the property was directly observed for a live isolate by one of the 
authors or an expert mentioned in the acknowledgements. 
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Figure 1. A consensus phylogenetic tree based on 16S rRNA gene sequence comparisons, highlighting the position of 
strain EBOI^ Bacillus algeriensis relative to other type strains within the Bacillus genus. GenBank accession numbers 
are displayed in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences made using the 
neighbor-joining method [38] within the MEGA 5 software [39]. Numbers above the nodes are percentages of boot- 
strap values from 1 ,000 replicates that support the node. Paenibacillus polymyxa was used as the outgroup. The scale 
bar represents 0.01 substitutions per nucleotide position. 



Six different growth temperatures [25, 30, 37, 45, 
50 and 55°C), nine NaCl concentrations (0, 2.5, 5, 
7.5, 10, 15, 20, 25, 30%) and ten pHs (5, 6, 6.5, 7, 
7.5, 8, 8.5, 9, 10, 11) were tested. Growth occurred 
at all tested temperatures, however the optimal 
growth was observed at 37°C, between 0% and 
2.5% NaCl concentration and pH in the range of 
6.5-9 (optimum at pH 7). Colony morphology was 
observed on sheep blood agar [BioMerieux) after 
24 h of aerobic incubation under optimal growth 
conditions, the colonies of strain EBOl^ were cir- 
cular, light yellow, smooth and 2 mm in diameter. 
Growth of the strain was tested in anaerobic and 
microaerophiUc atmospheres using GasPak EZ 
Anaerobe Pouch [Becton, Dickinson and Compa- 



ny) and CampyGen Compact (Oxoid) systems, re- 
spectively, and in aerobic atmosphere, with or 
without 5% CO2. Growth was achieved under aer- 
obic (with and without C02) and microaerophilic 
conditions but weak growth was observed under 
anaerobic conditions. Gram staining showed 
Gram-positive rods [Figure 2). Cells grown on 
agar sporulate. A motility test was positive. The 
size of cells were determined by negative staining 
transmission electron microscopy on a Technai G^ 
Cryo [FEI) at an operating voltage of 200 kV, the 
rods have a length ranging from 2.4 ^m to 4.9 |im 
[mean 3.6 |im) and a diameter ranging from 0.7 
\im to 1.1 |im [mean 0.8 |im) [Figure 3). 
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Strain EBOIt exhibited catalase activity but oxi- 
dase activity was negative. Using the commercially 
available API 50CH system (BioMerieux) accord- 
ing to the manufacturer's instructions, a weak 
positive reaction was observed for D-ribose, D- 
glucose, D-fructose, methyl a-D-glucopyranoside, 
N-acetylglucosamine, D-maltose, D-lactose, D- 
melibiose, D-saccharose, D-trehalose, D-tagatose, 
and hydrolysis of starch. Other tests were nega- 
tive. Using the API ZYM system (BioMerieux), pos- 
itive reactions were observed for alkaline phos- 
phatase, esterase (C4), esterase lipase (C8), 
leucine arylamidase, a-chymotrypsin, (B-glucuron- 
idase, a-glucosidase, N-acetyl-glucosaminidase and 
a weak positive reaction was observed for acid 
phosphatase. The nitrate reduction and (B- 
galactosidase reaction was also positive, but ure- 
ase and indole production were negative. B. algeri- 
ensis was susceptible to amoxicillin, nitrourantoin, 
erythromycin, doxycycline, rifampicin, vanco- 
mycin, gentamicin, imipenem, trimethoprim-sulfa- 
methoxazole, ciprofloxacin, ceftriaxone and amox- 
icillin-clavulanic acid, but resistant to nalidixic acid. 

When compared to other Bacillus species [40-48], 
Bacillus algeriensis sp. nov. strain EBOIt exhibited 
the phenotypic differences detailed in (Table 2). 

Matrix-assisted laser-desorption/ionization time- 
of-flight (MALDI-TOF) MS protein analysis was 
performed as previously described [26,49,50]. 
Briefly, strain EBOl^ was plated on 5% sheep 
blood-enriched Columbia agar (BioMerieux) and 
incubated for 24 h at 37°C. Isolated bacterial colo- 



Figure 3. Transmission electron micrograph of B. 
algeriensis strain EBOI^ made using a Technai Cryo 
(FEI) at an operating voltage of 200 kV. The scale bar 
represents 500 nm. 

nies were picked, and then deposited as a thin film 
in 12 replicates on a MALDI-TOF steel target plate 
(Bruker Daltonics, Bremen, Germany). The plates 
were allowed to dry at room temperature. Each 
deposit was overlaid with 1.5 ^1 of matrix solution 
containing a-cyano-hydroxycinnamic acid (Sigma, 
Saint-Quentin Fallavier, France) saturated with 
50% acetonitrile, 2.5% trifluoroacetic acid and 
high-performance liquid chromatography (HPLC)- 
grade water, and allowed to co-crystallize with the 
sample. Measurements were conducted using the 
Microflex LT spectrometer (Bruker Daltonics). 
Spectra were recorded in the linear positive ion 
mode over a mass range of 2 to 20 kDa. The accel- 
eration voltage was 20 kV. Spectra were collected 
as a sum of 240 shots across a spot. The 12 EBOIt 
spectra were imported into the MALDI BioTyper 
software (version 3.0, Bruker) and analyzed by 
standard pattern matching (with default parame- 
ter settings) against 6,335 bacterial spectra in- 
cluding 210 spectra from 104 Bacillus species, 
used as reference data, in the BioTyper database. 
A score enabled the identification, or not, from the 
tested species: a score > 2 with a vaUdated species 
enabled the identification at the species level, a 
score > 1.7 but < 2 enabled the identification at 
the genus level; and a score < 1.7 did not enable 
any identification. For strain EBOIt, the scores ob- 
tained ranged from 1.15 to 1.60 thus suggesting 
that our isolate was a new species. We added the 
spectrum from strain EBOIt (Figure 4) to our da- 
tabase. Spectrum differences with other of Bacillus 
species are shown in (Figure 5). 
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Figure 4. Reference mass spectrum from B. algeriensis strain EBOl^. Spectra from 12 
individual colonies were compared and a reference spectrum was generated. 
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Figure 5. Gel view comparing Bacillus algeriensis EBOI^ spectra with other members of the Bacillus genus (6. 
weihenstephanensis, B. vallismortis, B. thuringiensis, B. thioparans, B. subtilis subsp. subtilis, B. subterraneus, B. 
shackletonii, B. novalis, B. niacini, B. nealsonii, B. jeotgali, B. flexus, B. circulans, B. bataviensis and B. asahii). 
The Gel View displays the raw spectra of all loaded spectrum files as a pseudo-electrophoretic gel. The x-axis 
records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra 
loading. The peak intensity is expressed by a grey scale scheme code. The grey scale bar on the right y-axis indi- 
cates the relation between the shade of grey a peak is displayed with and the peak intensity in arbitrary units. 
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Genome sequencing information 

Genome project history 

The organism was selected for sequencing on the 
basis of its phylogenetic position and 16S rRNA 
similarity to other members of the genus Bacillus, 
and is part of a study of Bacillus genus diversity in 
hypersaline lakes of Algeria. It was the 398"^ ge- 



Table 3. Project information 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


High-quality draft 


MIGS-28 


Libraries used 


Nextera XT library 


MlGS-29 


Sequencing platform 


Miseq-lllumina 


MIGS-31. 2 


Sequencing coverage 


34x 


MIGS-30 


Assemblers 


Velvet 


MlGS-32 


Gene calling method 


Prodigal 




EMBL Date of Release 


January 10, 2014 




EMBL ID 


ERP003483 






Study of the Bacillus genus diversity in 


MlGS-13 


Project relevance 


hypersaline lakes of northeastern Algeria 



nome of a Bacillus species and the first genome of 
Bacillus algeriensis sp. nov. The EMBL accession 
number is ERP003483 and consists of 46 contigs. 
Table 3 shows the project information and its as- 
sociation with MIGS version 2.0 compHance [51]. 



Growth conditions and DNA isolation 

Bacillus algeriensis sp. nov strain EBOl''', was 
grown aerobically on 5% sheep blood enriched 
Columbia agar at 37°C. Three Petri dishes were 
spread and resuspended in a 2 ml sterile 
Eppendorf tube containing 1ml of TE buffer with 
acid-washed glass beads (diameter <106 \im, Sig- 
ma, Saint-Quentin Fallavier, France). Three cycles 
of shaking were performed using a FastPrep BIO 
101 apparatus (Qbiogene, Strasbourg, France) for 
15 sec at level 6.5 (full speed). Then, the superna- 
tant was placed in a new tube along with one hun- 
dred \i\ of 10% SDS and 50 \i\ of Proteinase K 
(Qiagen GmbH, Hilden, Germany) and incubated 
over night at 56°C. The digested mixture was used 
to perform DNA extraction using the classical 
phenol-chloroform method. The quality of the 
DNA was checked on an agarose gel (0.8%) 
stained with SYBR safe. 

Genome sequencing 

Genomic DNA of B. algeriensis sp. nov. strain 
EBOIT was sequenced on the MiSeq platform 
(Illumina, Inc, San Diego CA 92121, USA) with a 
paired end and barcode strategy in order to be 
mixed with 7 others genomic projects constructed 
with the Nextera XT library kit (Illumina). 

The gDNA was quantified by a Qubit assay with 
the high sensitivity kit (Life technologies, Carls- 
bad, CA, USA) to 34.4 ng/|iL and dilution was per- 
formed to provide 1 ng of each small genome as 
input. The "tagmentation" step fragmented and 
tagged the DNA to generate an optimum insert 



size of 1.6 kb, validated on a high sensitivity 
labchip Calliper-Perkin Elmer (Caliper Life Scienc- 
es, Inc, Massachusetts, USA). Then limited cycle 
PGR amplification completed the tags adapters 
and introduced dual-index barcodes. After purifi- 
cation on Ampure beads (Lifetechnolgies, Carls- 
bad, CA, USA), the libraries were normalized on 
specific beads according to the Nextera XT proto- 
col (Illumina). Normalized libraries are pooled in- 
to a single library for sequencing on the MiSeq. 
The pooled single strand library was loaded onto 
the reagent cartridge and then onto the instru- 
ment along with the flow cell. Automated cluster 
generation and paired-end sequencing with dual 
index reads was performed in a single 39-hour 
run with a 2x250 bp read length. Within this 
pooled run, the index representation was deter- 
mined to 7.1%. Total information of 2.4 G bases 
was obtained from a 320 K/mm2 density with 
94.9% (5,757,000 clusters) of the clusters passing 
quality control (QC) filters. From the genome se- 
quencing process, the 775,420 produced Illumina 
reads for B. algeriensis EBOl^ were filtered ac- 
cording to the read qualities and sizes using the 
fastq-mcf program (Ea-utils: command-line tools 
for processing biological sequencing data) [52]. 
714,540 filtered read sequences were kept for ge- 
nome assembly. The Velvet assembler was used 
with different kmer values (from 51 to 95) and the 
best assembly result with kmer value (n=91) pro- 
ducing 46 contigs with sizes from 872 bp to 
409,112 bp, was retained for genome annotation. 
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Genome annotation 

Open Reading Frames [ORFs) were predicted us- 
ing Prodigal [53] with default parameters. The 
predicted bacterial protein sequences were 
searched against the Clusters of Orthologous 
Groups [COG) database and the GenBank database 
[54] using BLASTP. Ribosomal RNAs were found 
by using RNAmmer 2.1 server [55,56] and BLASTn 
against the GenBank database, whereas the 
tRNAScanSE tool [57] was used to find tRNA 
genes. Transmembrane helices and lipoprotein 
signal peptides were predicted using phobius web 
server [58]. ORFans were identified if their 
BLASTP E-value was lower than le-03 for align- 
ment length greater than 80 amino acids. If align- 
ment lengths were smaller than 80 amino acids, 
we used an E-va\ue of le-05. Artemis [59] was 
used for data management and DNA Plotter [60] 
was used for visualization of genomic features. To 
estimate the mean level of nucleotide sequence 
similarity at the genome level between B. 
algeriensis sp nov. strain EBOl^ and seven other 
Bacillus species, we use the Average Genomic 
Identity of Orthologous gene Sequences [AGIOS) 
in-house software. Briefly, this software combines 



the Proteinortho software [61] for pairwise com- 
parison and detection of orthologous proteins be- 
tween genomes, then retrieves the corresponding 
genes and determines the mean percentage of nu- 
cleotide sequence identity among orthologous 
ORFs using the Needleman-Wunsch global align- 
ment algorithm. 

Genome properties 

The genome is 5,269,577 bp long with 42.22% GC 
content [Figure 6 and Table 4). It is composed of 
46 contigs. Of the 5,193 predicted genes, 5,098 
were protein-coding genes, and 95 were RNAs [10 
genes encode 5S rRNA, 1 gene encodes 16S rRNA, 
1 gene encodes 23S rRNA, 83 genes are tRNA 
genes). A total of 3,217 genes [63.1%) were as- 
signed a putative function [by cogs or by NR 
blast). 457 genes were identified as ORFans 
[8.96%). The remaining genes were annotated as 
hypothetical proteins [1,097 genes, 21.52%). The 
distribution of genes into COGs functional catego- 
ries is presented in Table 5. The properties and 
statistics of the genome are summarized in Tables 
4 and 5. 
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Figure 6. Graphical circular map of the chromosome. From outside to the center: Red and 
gray bars representing contigs, genes on the forward strand colored by COG categories (only 
genes assigned to COG), genes on the reverse strand colored by COG categories (only genes 
assigned to COG), RNA genes (tRNAs green, rRNAs red), GC content. The inner-most circle 
shows the GC skew, purple and olive indicating negative and positive values, respectively. 
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Table 4. Nucleotide content and gene count levels of the genome 



Attribute 


Value 


% of total" 


Genome size (bp) 


5,269,577 


100 


Coding region (bp) 


4,342,253 


82.4 


G+C content (bp) 


2,224,760 


42.22 


Total genes 


5,193 


100 


RNA genes 


95 


1 .82 


Protein-coding genes 


5,098 


98.1 7 


Genes with function prediction 


3,217 


63.1 


Genes assigned to COGs 


3,041 


59.65 


Genes with peptide signals 


653 


12.8 


Genes with transmembrane helices 


1,297 


25.44 



" The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the 

annotated genome 



Table 5. Number of genes associated with the 25 general COG functional categories 



Code 


Value 


% age" 


Description 


J 


176 


3.45 


Translation, ribosomal structure and biogenesis 


A 


0 


0 


RNA processing and modification 


K 


281 


5.51 


Transcription 


L 


165 


3.23 


Replication, recombination and repair 


B 


1 


0.01 


Chromatin structure and dynamics 


D 


33 


0.64 


Cell cycle control, mitosis and meiosis 


Y 


0 


0 


Nuclear structure 


V 


63 


1.23 


Defense mechanisms 


T 


144 


2.82 


Signal transduction mechanisms 


M 


156 


3.06 


Cell wall/membrane biogenesis 


N 


48 


0.94 


Cell motility 


Z 


0 


0 


Cytoskeleton 


W 


0 


0 


Extracellular structures 


u 


50 


0.98 


Intracellular trafficking and secretion 


o 


no 


2.15 


Posttranslational modification, protein turnover, chaperones 


c 


206 


4.04 


Energy production and conversion 


G 


277 


5.43 


Carbohydrate transport and metabolism 


E 


370 


7.25 


Amino acid transport and metabolism 


F 


82 


1.60 


Nucleotide transport and metabolism 


H 


109 


2.13 


Coenzyme transport and metabolism 


1 


142 


2.78 


Lipid transport and metabolism 


P 


213 


4.17 


Inorganic ion transport and metabolism 


Q 


86 


1.68 


Secondary metabolites biosynthesis, transport and catabolism 


R 


521 


10.21 


General function prediction only 


S 


327 


6.41 


Function unknown 




2,057 


40.34 


Not in COGs 



" The total is based on the total number of protein coding genes in the annotated genome 
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Comparison with other species Bacil- 
lus genomes 

Here, we compared the genome of B. algeriensis 
strain EBOIt with those of B. kribbensis strain DSM 
17871, B. nealsonii strain AAUl, B. bataviensis 
strain LMG 21833, B. subtilis subsp. subtilis strain 
168, B. vallismortis strain DVl-F-3, B. thuringiensis 
strain BMB171 and B. weihenstephanensis strain 
KBAB4 [Table 6). The draft genome of R 
algeriensis (5.26Mb) is larger in size than those of 
B. kribbensis, B. nealsonii, B. subtilis subsp. subtilis 
and B. vallismortis (5.05, 4.98, 4.22 and 3.88 Mb, 
respectively) but smaller than those of, B. 
bataviensis, B. thuringiensis and B. 
weihenstephanensis (5.37, 5.64 and 5.87 Mb, re- 
spectively). B. algeriensis has a lower G+C content 
than B. kribbensis, B. subtilis subsp. subtilis and B. 
vallismortis (42.22% vs 43%, 43.5% and 43.8%, 
respectively) but higher than B. nealsonii, B. 
bataviensis, B. thuringiensis and B. 
weihenstephanensis (42.22% vs 35.1%, 39.6%, 



35.2% and 35.5%, respectively). B. algeriensis has 
more predicted protein coding genes (5,098) than 
B. kribbensis, B. nealsonii, B. subtilis subsp. subtilis 
and B. vallismortis (4,918, 4,789, 4,175 and 4,097, 
respectively) but fewer protein coding genes than 
B. bataviensis,B. thuringiensis and B. 
weihenstephanensis (5,207, 5,352 and 5,653, re- 
spectively). In addition, B. algeriensis shared 
1,804, 1,778, 2,017, 1,768, 1,985, 1,541, 1,863 
orthologous genes with B. thuringiensis, B. 
nealsonii, B. bataviensis, B. subtiUs subsp. subtilis, B. 
kribbensis, B. vallismortis, B. weihenstephanensis 
respectively. 

The average nucleotide sequence identity of 
orthologous genes ranges from 64.54 to 91.06% 
among the 8 Bacillus species, and from 64.77 to 
69.33% between Bacillus algeriensis and the other 
compared genomes (Table 7), thus confirming its 
new species status. 



T able 6. Genomic comparison of B. algeriensis sp. nov. strain EBOI^ with seven other Bacillus species^. 

Genome accession Genome size 
Species Strain number (Mb) 



G+C 
content 



Bacillus algeriensis 



EBOV 



ERP003483 



5.26 



42.22 



Bacillus kribbensis 



DSM 17871 



AUMQ00000000.1 5.05 



43 



Bacillus nealsonii 



AAUl 



ASRU00000000.1 



4.98 



35.1 



Bacillus bataviensis 



LMG 21833 



AJLS00000000.1 



5.37 



39.6 



Bacillus subtilis subsp. subtilis 



168 



NC 000964.3 



4.22 



43.5 



Bacillus vallismortis 



DVl-F-3 



AFSH00000000.1 



3.88 



43.8 



Bacillus thuringiensis 



BMB171 



NC 014171.1 



5.64 



35.2 



Bacillus weihenstephanensis 



KBAB4 



NC 010184.1 



5.87 



35.5 



^ species and strain names, genome accession numbers, sizes and G+C contents. 
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Conclusion 

On the basis of phenotypic [Table 2), phylogenetic 
and genomic analyses (taxonogenomics) [Table 
6), we formally propose the creation of Bacillus 
algeriensis sp. nov. that contains the strain EBOIt. 
This strain has been found in hypersaline lacus- 
trine sediment sample collected from Algeria. 

Description of Bacillus algeriensis sp. nov. 

Bacillus algeriensis [al.ge.ri.en'sis. NL. masc.adj. 
algeriensis, of or pertaining to Algeria). Strain 
EBOIT is a facultative anaerobic Gram-positive, 
endospore-forming, motile and rod shaped bacte- 
rium with rounded ends. Growth is achieved aer- 
obically between 30 and 55°C (optimum 37°C), 
between 0% and 2.5% NaCl concentration and pH 
in the range of 6.5-9 [optimum at pH 7). Growth is 
also observed in microaerophilic atmosphere, 
however, weak growth was observed under an- 
aerobic conditions. After 24h growth on 5% sheep 
blood-enriched Columbia agar [BioMerieux) at 
37°C, bacterial colonies were smooth, light yellow 
with 2 mm in diameter. Cells have a length ranging 
from 2.4 |im to 4.9 |im [mean 3.6 |im) and a diam- 
eter ranging from 0.7 |im to 1.1 |im [mean 0.8 |im). 

Catalase positive but oxidase negative. Using the 
commercially available API 50CH system 
[BioMerieux) according to the manufacturer's in- 
structions, a weak positive reaction was observed 
for D-ribose, D-glucose, D-fructose, methyl a-D- 
glucopyranoside, N-acetylglucosamine, D-maltose, 
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